10086816 Results 



SEQ ID NO: 2 



SUMMARIES 

% 

Result Query- 
No. Score Match Length DB ID Description 



1 


2 84 9 


100 


. 0 


55 0 


2 1 


7V A VA A *? *7 Q 


Human organic sinio 


Z 


Z O 4 7 


100 


. 0 




z z 


HAB^i / Z / X 


Vi a T"1 Wrtmn oari a 




O ft *iO £ 
Z O .3 


99 


.4 


^ a i 

DO J 


z u 


A & Wft ft A A Q 


nutiLciii (J x y ctn -Lt- diixu 


ft 


iiJjO .3 


89 


.1 


3 3 X 


z u 


a a wr ftAftft 


KaL Organic cLL1Xv_>X1 


3 


o A <r i t; 

Z *t O J .3 


86 


.5 


CAE 
Oft 3 


z z 


HADJ O 3 3 -5 


noubc uiyainc aQio 




Xft it) 


49 


.7 


542 


22 


URA 79 74 


ViOa'T 1 '^ Wr^mo aani ^ 


n 

1 


X*fc 13 


49 


.7 


3*4 Z 


Z X 


An X -7Z 


n mud ii LercDidi uiy 


8 


1373 . 5 


48 


.2 


box 


1 8 


A IV U7/1 /II Qf 
AAW4 4 X y b 


Human osteoclast t 


9 


1371 


48 


.1 


537 


18 


AAW44195 


Mouse osteoclast t 


10 


1370 


48 


.1 


536 


21 


AAY92 903 


Rat cerebral organ 


11 


1287.5 


45.2. 


557 


24 


ABU54646 


Human NOVX polypep 


12 


1266.5 


44 


.5 


556 


24 


ABJ37918 


NOVX protein seque 


13 


1264 


44 


.4 . 


584 


23 


ABP70090 


Human N0V18a. Horn 


14 


1250 


43 


.9 


589 


23 


AAE13280 


Human transporters 


15 


1249.5 


43 


.9 


573 


24 


ABJ37917 


NOVX protein seque 


16 


1231 


43.2 


778 


24 


ABP54440 


Human TCH110 prote 


17 


1227.5 


43 


.1 


553 


22 


AAE10332 


Human transporter 


18 


1227.5 


43 


.1 


553 


23 


ABP70091 


Human NO VI 9a. Horn 


19 


1211.5 


42 


.5 


815 


22 


ABG2 68 99 


Novel human diagno 


20 


12 08 


42 


.4 


578 


22 


AAE06571 


Human protein havi 


21 


1173.5 


41 


.2 


553 


24 


ABG73329 


Human organic ion 


22 


1133 


39 


.8 


550 


22 


AAE06612 


Human protein havi 


23 


1133 


39 


.8 


550 


22 


AAB69091 


Human organic anio 


24 


1133 


39 


.8 


550 


23 


AAE16772 


Human transporter 


25 


1133 


39 


.8 


550 


24 


ABB82969 


Human SLC22A relat 



RESULT 1 




AAY44278 




ID 


AAY44278 standard; Protein; 550 AA . 


XX 






AC 


AAY44278; 




XX 






DT 


29-FEB-2000 


(first entry) 


XX 






DE 


Human organic 


anion transporter. 


XX 






KW 


Human organic 


anion transporter; hOAT; nephrotoxic 


KW 


drug-drug interaction; nucleotide phosphonate. 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Peptide 


515. .528 


FT 




/labels Immunogen 


FT 




/notes "For producing antibodies" 


XX 






PN 


W09964459-A2 . 




XX 






PD 


16-DEC-1999 . 




XX 






PF 


10-JUN-1999; 


99WO-US13172 . 


XX 






PR 


ll-JUN-1998; 


98US-0088864 . 


PR 


03-MAY-1999; 


99US-0132267. 


XX 






PA 


(GILE-) GILEAD SCI INC. 


XX 






PI 


Cihlar T; 




XX 






DR 


WPI; 2000-097519/08. 



DR N-PSDB; AAZ2 93 00. 
XX 

PT Human organic anion transporter nucleic acid used in drug screening, 

PT identifying drug-drug interactions or individuals in whom particular 

PT drugs are nephrotoxic 
XX 

PS Claim 9; Fig 1; 3 6pp; English. 
XX 



CC The present sequence is human organic anion transporter (hOAT) protein. 

CC hOAT is expressed in kidney and brain and removes toxic anions 

CC from the circulation. It can be produced in host cells by transforming 

CC them with recombinant vectors containing nucleic acid encoding hOAT. 

CC It can be used in screening for compounds that suppress .or enhance anion 

CC uptake and transport by hOAT, to identify molecular variants of 

CC nephrotoxic compounds, to detect drug-drug interaction within kidney 

CC and brain and to test for transport of a covalently modified form of a 

CC nucleotide phosphonate analogue by hOAT. 



XX 

SQ Sequence 55 0 AA; 

Query Match 100.0%; Score 2 84 9; DB 21; Length 55 0; 

Best Local Similarity 100.0%; Pred. No. 2.6e-281; 

Matches 55 0; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


1 


MA FNDLLQQVGGVGRFQQ I QVTLWLPLLLMASHNTLQNFTAA I PTHHCRP PADANL S KN 


60 






IMIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1 


MA FNDLLQ Q VGGVGRFQQ I Q VTL WLPL LLMA SHNTL QNFTA A I PTHHCRP PADANL S KN 


60 


Qy 


61 


GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 


12 0 






IIIIIIIIMIIIIIIIIMIIIIIIIIIHIIIIIIIIIIIIIIIMIIIIIIIIIIII 




Db 


61 


GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 


12 0 


Qy 


121 


VTE WDL VC S HRA LRQL AQ S L YMVGVLLGAMVFG YL A DRLGRRKVL I LNYLQTA VS GTC A A 


180 






III INI II III II II II III II II II MINI III III II II II II III INI II II II 




Db 


, 121 


VTE WDL VC S HRA LRQL AQ S L YMVGVLLGAMVF GYL A DRLGRRKVL I LNYLQT A VSGT C A A 


180 


Qy 


181 


FAPNFPIYCAFRLLSGMALAGISLNCMTLNVEWMPIHTRACVGTLIGYVYSLGQFLLAGV 


240 






Ml 1 1 II 1 1 1 1 1 il II 1 1 1 1 i 1 1 1 1 II 1 1 IS 1 1 !l 1 1 1 1 !l 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 




Db 


181 


FA PNF P I YC A FRLL SGMA L AG I S LNCMTLNVE WM P I HTRA C VGTL I G YVYS LGQ FLL AGV 


240 


Qy 


241 


AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 


300 






1 1 1 1 1 III I II 1 II II 1 1 1 1 1 II II II 1 III III II II 1 II II II III 1 1 II MM MM 




Db 


241 


AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 


300 


Qy 


301 


GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 


360 






III II II 1 MINI II II III II II II II II II III II III II II II III II II II MM 




Db 


3 01 


GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 


360 


Qy 


361 


QGFGVS I YL I QVIFGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI CILLNGVI PQDQS 


420 






MIIMIMIIIMIIIIIIIIIIIIIMIMIMMIMMIIIIIIIIIIMIIIIM 




Db 


361- QGFGVS I YLIQVIFGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI CILLNGVI PQDQS 


420 


Qy 


421 


IVRTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGSIVSPLVSMTAE 


480 






MIIIIIIIMIIIIMIIMMMIMMMMMMMIIIMIMIIIMIMIIII 




Db 


421 


IVRTSLAVLGKGCLAASFNCI FLYTGELYPTMIRQTGMGMGSTMARVGS IVSPLVSMTAE 


480 


Qy 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRKGKQTRQQQEHQKYMVPL 


540 






M 1 1 1 1 1 II II 1 1 II 1 1 1 1 II 1 II M M 1 1 1 II M II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 II II 




Db 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRKGKQTRQQQEHQKYMVPL 


540 


Qy 


541 


QASAQEKNGL 55 0 

IMIIIMM 




Db 


541 


QASAQEKNGL 55 0 





RESULT 2 
AAB47271 

ID AAB47271 standard; Protein; 55 0 AA . 
XX 

AC AAB472 71; 
XX 



DT 06-AUG-2 001 (first entry) 
XX 

DE hOATl . 
XX 

KW Human; organic anion transporter; hOAT; liver; kidney; 

KW membrane protein; transport; organic anion; splice variant. 
XX 

OS Homo sapiens . 
XX 

FH Key 

FT Modified-site 
FT 

FT Modified-site 
FT 

FT Modified-site 
FT 
XX 

PN WO2 0 01042 83-A2 
XX 

PD 18-JAN-2001. 
XX 

PF 12-JUL-2000; 2 000WO-US18 98 0 . 
XX 

PR 12-JUL-1999; 99US-0143771 . 
XX 

PA (META-) ME TAB AS IS THERAPEUTICS INC. 
XX 

PI Sun W; 
XX 

DR WPI; 2001-367057/38. 

DR N-PSDB; AAC85819. 
XX 

PT Nucleic acids encoding human organic anion transporter polypeptides, 

PT useful in gene therapy procedures - 

XX 

PS Example 2; Fig 1; 95pp; English. 
XX 

CC The sequences given in AAB47271-76 represent human organic anion 

CC transporter (hOAT) polypeptides. hOAT polypeptides are preferentially 

CC expressed in the liver and kidneys of humans. OAT's are membrane 

CC proteins that facilitate the transport of organic anions across the 

CC cell membrane. The mechanism of transport is thought to be a secondary 

CC or tertiary active transport involving exchange of another organic anion. 

CC hOAT2A and hOAT2B are thought to be splice variants as they are 

CC identical except at the C-terminal end. hOAT proteins and the DNA 

CC encoding them, may be used in the prevention, treatment and diagnosis 

CC of diseases associated with inappropriate hOAT expression. 

XX 

SQ Sequence 55 0 AA; 

Query Match 100.0%; Score 2 849; DB 22; Length 550; 

Best Local Similarity 100.0%; Pred. No. 2.6e-2 81; 

Matches 55 0; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 MAFNDLLQQVGGVGRFQQIQVTLWLPLLLMASHNTLQNFTAAIPTHHCRPPADANLSKN 60 

MM II 1 1 III II II II MINIMI II MINIMI MM Mill II 1 1 II II I MM I 

Db 1 MAFNDLLQQVGGVGRFQQIQVTLVVLPLLLMASHNTLQNFTAA I PTHHCRPPADANLSKN 60 

Qy 61 GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 12 0 

I M M M M MMI M M M MMI MINIM MIMM M MM MMMM M 1 1 1 1 

Db 61 GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 12 0 

Qy 121 VTEWDLVCSHRALRQLAQSLYMVGVXLGAMVFGYLADRLGRRKVLILNYLQTAVSGTCAA 18 0 

NNNNNNNNNIINNNNNNNIINIIINIIIINIINIIIINI 
Db 121 VTEWDLVCSHRALRQLAQSLYMVGVXLGAMVFGYLADRLGRRKVLILNYLQTAVSGTCAA 18 0 

Qy 181 FAPNFPIYCAFRLLSGMALAGISLNCMTLNVEWMPIHTRACVGTLIGYVYSLGQFLLAGV 240 

I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 FA PNF P I Y C A FRLLS GMAL AG I S LNCMTLNVE WMP I HTRA C VGTL I GYVYSLGQ FLL A GV 24 0 

Qy 241 AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 300 



Location/Qualifiers 
271 

/note= "PKC phosphorylation site" 
278 

/notes "PKC phosphorylation site" 
284 

/note= "PKC phosphorylation site" 



1 1 1 1 1 II II 1 1! I M 1 1 1 1 III 1 1 1 Mill II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 

Db 241 AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 300 

Qy 301 GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 360 

I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III II 
Db 301 GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 360 

Qy 361 QGFGVS I YLIQVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGIC ILLNGVIPQDQS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 Q GFGVS I YL I Q V I FGA VDL PA KL VGFL V I NS LGRRP AQMA AL LL AG I C I L LNGV I P QDQ S 420 

Qy 421 IVRTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGS I VSPLVSMTAE 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I II I I I I 

Db 421 IWTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGSIVSPLVSMTAE 480 

Qy 481 LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRKGKQTRQQQEHQKYMVPL 540 

II lllllllllll II IIIIIIIIIIIIMIIIIM I II II 1 1 1 ' : ' i : I I II 

Db 481 LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRKGKQTRQQQEHQKYMVPL 540 

Qy 541 QASAQEKNGL 55 0 

Illlllllll 
Db 541 QASAQEKNGL 550 



RESULT 3 
AAW88489 

ID AAW88489 standard; Protein; 563 AA . 
XX 

AC AAW88489; 
XX 

DT 05-MAR-1999 (first entry) 
XX 

DE Human organic anion transporter OAT-1 . 
XX 

KW Organic anion transporter; OAT-1; nephrotoxicity; drug release; 

KW drug -drug interaction mechanism; drug elimination; kidney; 

KW beta-lactam antibiotic; non-steroidal antiinflammatory; diuretic. 

XX 

OS Homo sapiens. 
XX 

PN WO9853064-A1. 
XX 

PD 26-NOV-1998. 
XX 

PF 18-MAY-1998; 98WO- JP02 171 . 
XX 

PR 23-MAY-1997; 97 JP- 0134 182 . 
XX 

PA (TANA ) TANABE SEIYAKU CO. 
XX 

PI Endou H, Hosoyamada M, Kanai Y, Sekine T; 
XX 

DR WPI; 1999-045310/04. 

DR N-PSDB; AAV79585 . 
XX 

PT New renal organic anion transporter protein - useful for, e.g. 

PT screening potential drugs for prevention of nephrotoxicity and as a 

PT reagent for the investigation of drug metabolism 

XX 

PS Claim 1; Page 2 7-32; 4 5pp; Japanese. 
XX 

CC The present sequence represents human organic anion transporter protein 

CC OAT-1. OAT-1 may be used as a reagent for the in vitro analysis and 

CC study of drug release and drug-drug interaction mechanisms and drug 

CC elimination via the kidney (e.g. for drugs such as beta-lactam 

CC antibiotics, non-steroidal antiinflammatories and diuretics) , and as 

CC a reagent for screening candidate drugs for the prevention of 

CC nephrotoxicity. Oligonucleotide sequences which hybridise to the OAT-1 

CC polynucleotide may be used as probes for detection of OAT-1 gene 

CC sequences . 



XX 

SQ Sequence 56 3 AA; 



Query Match 99.4%; Score 2832.5; DB 20; Length 563; 

Best Local Similarity 97.7%; Pred. No. 1.3e-279; 

Matches 55 0; Conservative 0; Mismatches 0; Indels 13; Gaps 1; 



yy 


J. 


m a punT .t .n n vf?nvaR pnn t hvtt ,wt , pt jTjLMA s hntlonft a a t pthttp p p p a n ant . ^ km 

i Y jj-i r iv u j_i j_i y ^ v ijtj v uA.r j.y v iu v vjjrjjiii j i ixn x xj yjM r xhm in ixxiv^rvir tr rixvrix^xjo x\xn 


60 






III II II III 1 1 1.1 Nil INI II II II II II 11 II lllllll MM II II II llll III 




Db 


1 


MAFNDLLQQVGGVGRFQQIQVTLV^PLLLMASHNTLQNFTAAIPTHHCRPPADANLSK^ 


60 


yy . 


6 1 


nnT.FVWT.PPnPnnnPF^rT.P FT^PftWftT.PFl^NGTEANGTGATFPC'TnO.WTYrjNSTFPSTT 


12 0 






IIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


61 


GGLE VWLPRDRQGQ PE SCLPvFTSPQ WGLPFLNGTE ANGTGATE P CTDGW I YDNSTFPST I 


12 0 


yy 


12 1 


VTP WTjT .VP c; pro a T .POT . AO *?T . YMVflVT .T.GAMVFGYTi A DP TiGPP KVTi T T iMYTjOTAV^GTGA A 

V ID TT lJ Xj V V— O niUlUIv^iJfl^OU X 11 V VJ V DiJVJnll V X VJ X XJ^i i J i\ i i v t n r\ r\ V XJ X XJ1N X Xjy X A V O VJ X V- ZA*V 


180 






III lllllll IIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIII llll MINI 




Db 


121 


VTE WDL VC S HRALRQL AQ SL YMVGVLLGAMVFGYL A DRLGRRKVL I LNYLQT A VS GT C A A 


180 


yy 


IS 1 


FS PWFP T VPa FPT J. C3flMa T.a f5T <?T.MPMTT.'KrVFWMP THTl? A PVfVT'T.T flYW^T .nDFT.T.AflV 

X i-iXTX'f F r 1 I V,rtf Kl'l' -^ItI'I MUHnl CJiJVi X XJ1M V Xj r» 11 XT X XI X lUlV. VV7X XJ X VJX V X DXJUy f i 1 1 imitm 


24 0 






1 1 1 M 1 1 1 1 1 1 1! 1! 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 M 




Db 


181 


FAPOTPI YCAFPiLSGMALAGI SLNCMTLNVE WMP IHTRACVGTL I GYVYSLGQFLLAGV 


240 


yy 


241 


a VaVPWWPTTT,nT,T.VC:apppaFPTYQWFFTF^APWW^^^flPT,TjT,TT,PaT,nPVaR TNiTFCFFF 


300 






MMMIMIMIMMIMMMMMIMIIIMIMIMIMIMIIIIIIMIMI 




Db 


241 


AYAVPH^RHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 


300 


yy 


i m 


Cia TfTrQMFVT.Pa c:T.nirFT.TMniT?<"ina <5aMFT.T.FnPTT.RWT,FT,PT.C!MT.WFaT<?FaYV(Tr.VMnT. 

kj.fi -TVXj 0 1*1 Xj V XjIXjH OXjy IYXj XJ X I'l^JT^Vjy /i 0>ilMXJ XJXJX\.l_ ir X XJlUlXjr XJl_XJOl T lXjVVr n X Of ±i. X 1 V7XJ V riUxj 


360 






II! Iliill III ll< III II II III ill 111:111 IIMI II 




nk 
JJD 


J vl 


VT.CM'PTrr.D a CT.ri V'FT.TMfl'If'f'iria CaMVT.T.PPDTT.PWT.T7T.PT.5MT.MT7aTc;pa WfTT A/MTV!". 
xVLi o 1*1 xj VXjKjfioxjy xvljXj 1 rlLyJWjyrt oA1UxjxjxjJX.v_. ir X xjrvJll.i x xj v_ xj o 1 v 11j WrH 1 o " /i I 1 Ltxj VI'IUxj 


36 0 


Qy 


361 


QGFGVS I YL I QVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI C I LLNGVI PQDQS 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


QGFGVS I YL I QVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI C I LLNGVI PQDQS 


420 


Qy 


421 


I VRTSLAVLGKGCLAASFNC I FLYTGELYPTMI RQTGMGMGSTMARVGS IVSPLVSMTAE 


480 






MIMMMIMIIIIIIIMIMIIIMMMIIMIMMMMIMMIIMIMM 




Db 


421 


I VRTSLAVLGKGCLAASFNC I FLYTGELYPTMI RQTGMGMGSTMARVGS IVSPLVSMTAE 


480 


Qy 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLES RKGKQT 


527 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 1! 1 1 llllll 




Db 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRWAPTQKEAGIYPRKGKQT 


540 


Qy 


528 


RQQ QE HQ K YMVP LQ A S AQE KNGL 55 0 

1 1 1 1 Ml 1 1 1 M M II IMI II! 




Db 


541 


RQQ QE HQ K YMVP LQ A S AQE KNGL 56 3 





Issued: 

SUMMARIES 

% 

Result Query 



> . 


Score 


Match Length 


DB 


. ID 








Description 




1 


2849 


100 . 6 


550 


4 


US- 


09 


-330 


-245A-2 


Sequence 2, 


Appli 


2 


2463 .5 


86 .5 


545 


4 


US- 


09 


-572 


-147-2 


Sequence 2 , 


Appli 


3 


1371 


48 .1 


537 


2 


US- 


08 


-647 


-397-2 


Sequence 2 , 


Appli 


4 


704 


24 .7 


556 


3 


US- 


08 


-501 


-572-1 


Sequence 1, 


Appli 


5 


704 


24 .7 


556 


3 


US- 


09 


-040 


-444-1 


Sequence 1, 


Appli 


6 


699.5 


24.6 


555 


3 


US- 


08 


-501 


-572-3 


Sequence 3 , 


Appli 


7 


699.5 


24 .6 


555 


3 


US- 


09 


-040 


-444-3 


Sequence 3 , 


Appli 


8 


674 


23.7 


553 


3 


US- 


08 


-501 


-572-2 


Sequence 2, 


Appli 


9 


674 


23.7 


553 


3 


US- 


09 


-040 


-444-2 


Sequence 2, 


Appli 



RESULT 2 
US-09-572-147-2 

; Sequence 2, Application US/09572147 



Patent No. 6420544 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 



Lin Yue 
John Feild 
Harma Ellens 

TITLE OF INVENTION: POLYNUCLEOTIDE AND POLYPEPTIDE SEQUENCES 

TITLE OF INVENTION: ENCODING MURINE ORGANIC ANION TRANSPORTER 5 (mOATPS)- AND 
TITLE OF INVENTION: SCREENING METHODS THEREOF 
FILE REFERENCE: GP-7 0622 

CURRENT APPLICATION NUMBER: US/09/572 , 147 
CURRENT FILING DATE: 2000-05-17 
PRIOR APPLICATION NUMBER: 60/134,879 
PRIOR FILING DATE: 1999-05-19 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 545 
TYPE: PRT 

ORGANISM: MUS MUSCULUS 
US-09-572-147-2 

Query Match 86.5%; Score 2463.5; DB 4;. Length 545; 

Best Local Similarity 85.1%; Pred. No. 8.9e-251; 

Matches 469; Conservative 41; Mismatches 34; Indels 7; Gaps 2; 
Qy 1 MAFNDLLQQVGGVGRFQQIQVTLVVLPLLLMASHNTLQNFTAA I PTHHCRPPADANLSKN 60 

. II 1 1 1 1 hi 1 1 1 1 II I Nihil II Mil II II INI Mill III II I hi III : 

Db 1 MAFNDLLKQVGGVGRFQLIQVTMWAPLLLMASHNTLQNFTAAIPAHHCRPPANANLSKD 6 0 

Qy 61 GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 12 0 

II I I Ml hlhlllllll I I MM llhllllllllll 

Db 61 GGLEAWLPLDKQGRPESCLRFPFPH NGTEANGTGVTEPCLDGWVYDNSTFPST I 114 

Qy 121 VTEWDLVCSHRALRQLAQSLYMVGVLLGAMVFGYLADRLGRRKVLILNYLQTAVSGTCAA 18 0 

Mlhlllllll MMMhlMMMIhlMMMMMMMIMMMIMIIM 

Db 115 VTEWNLVCSHRAFRQLAQSLFMVGVLLGAMMFGYLADRLGRRKVLILNYLQTAVSGTCAA 174 

Qy 181 F A PNF P I Y C A FRLL SGMAL A G I SLNCMTLNVE WMP I HTRA C VGTL I GYVY S LGQ FLL AGV 24 0 

Mlh Ml III Ml hi I hMIMIhllllMIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

Db 175 Y A PNYTVYC I FRLL SGMS L A S I A I NCMTLNME WMP I HTRA YVGTL I GYVY SLGQFLLAGI 234 

Qy 241 A Y A VPHWRHLQ L L VS A PFFAFFIYSWFFIE S ARWHS S S GRLDL TL RAL Q RVA R I NGKRE E 300 

I I I I I II I I II I II II I I I I I II I I II II h II I I I I I II I I I I I I I I I I I I h I I 
Db 2 35 AYAVPHWRHLQLAVSVPFFVAFIYSWFFIESARWYSSSGRLDLTLRALQRVARINGKQEE 2 94 

Qy 301 GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 360 

1 1 1 1 h 1 1 h IIMIIh I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III llhlh 1 1 I 1 1 I II II III 

Db 2 95 GA KL S I E VL Q TS LQ KE LTLNKGQ A S AME LLRC PTLRRL FL CL S ML W FATS FA Y YGL VMDL 354 

Qy 361 QGFGVS I YL I QVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI C I LLNGVI PQDQS 420 

I I I I I I : 1 I I I I I I I I I I I I I. I I I I I I I hi I I I I hhl I I I I I I I hi hi h : 

Db 355 QGFGVSMYL I QVI FGAVDLPAKFVCFLVINSMGRRPAQLASLLLAGI C I LVNGI I PRGHT 414 

Qy 421 IVRTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGSIVSPLVSMTAE 48 0 

hi I I I II I I I I I I hi M II I II I I I I I I I I III I hi I I I I II I I I I I I I I hll I I I 
Db 415 I IRTSLAVLGKGCLASSFNCIFLYTGELYPTMIRQTGLGMGSTMARVGS IVSPLISMTAE 474 

Qy 481 LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESR-KGKQTRQQQEHQKYMVP 539 

1 1 hi 1 1 hill I II 1 1 1 1 1 1 1 Ml II III I Ml 1 1 hi I MM Ml I II hi 

Db 475 FYPSIPLFIFGAVPVAASAVTALLPETLGQPLPDTVQDLKSRSRGKQKQQQLEQQKQMIP 534 

Qy 54 0 LQASAQEKNGL 55 0 

II I MMII 

Db 535 LQVSTQEKNGL 54 5 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 
J. 


I A3 


A z> 


A 
. *± 


j^j 


A 


.TP A ft ft A 


uiy cm x « x uii j. ct. 


A 


/US 


1 A 
A 


q 


^ c 
D D D 


A 




ULyaiiik- ^aLiuu i~a.ci 


3 


644.5 


A A 


. b 


3 / b 


A 




aypuLiicLicai pxv?i~e 


4 


641 


22 


m 5 


557 


2 


JW0 08 9 


organic cat ion tra 


5 


617 .5 


21 


.7 


557 


2 


JE0346 


high-affinity earn 


6 


450.5 


15 


.8 


794 


2 


T27870 


hypothetical prote 


7 


450 


15 


.8 


745 


2 


T16565 


hypothetical prote 


8 


434 .5 


15 


.3 


539 


2 


C96758 


probablle protein 


9 


408 .5 


14 


.3 


515 


2 


B96825 


hypothetical prote 


10 


401.5 


14 


.1 


527 


2 ■ 


T01019 


transport protein 


11 


395 .5 


13 


.9 


. 447 


2 


D89646 


protein ZK4 55 .8 [i 


12 


391 


13 


.7 


521 


2 


H86298 


hypothetical prote 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


734.5 


25 


.8 


556 


1 


0CN3_HUMAN 


075751 


homo sapien 


2 


696.5 


24 


.4 


551 


1 


0CN3_M0USE 


Q9wtw5 


mus musculu 


3 


696 .5 


24 


.4 


551 


1 


• 0CN3_RAT 


088446 


rattus norv 


4 


641 


22 


.5 


557 


1 


0CN2_HUMAN 


076082 


homo sapien 


5 


639.5 


22 


.4 


557 


1 


0CN2_M0USE 


Q9z0e8 


mus musculu 


6 


630.5 


22 


.1 


557 


1 


0CN2_RAT 


070594 


rattus norv 


7 


400.5 


14 


.1 


464 


1 


YT13_CAEEL 


Q10917 


caenorhabdi 


8 


387 


13 


.6 


529 


1 


Y0U1_CAEEL 


P30638 


caenorhabdi 


9 


352 .5 


12 


.4 


516 


1 


B0CT_RAT 


Q9p290 


rattus norv 



Result 
No. 



Score 



% 

Query 

Match Length DB 



SUMMARIES 



ID 



Description 



1 


2849 


100 . 


. 0 


550 


4 


Q8N192 


Q8nl92 homo sapien 


2 


2832 .5 


99 


.4 


563 


4 


095742 


095742 homo sapien 


3 


2600 


91 . 


. 3 


506 


4 


Q9NQC2 


Q9nqc2 homo sapien 


4 


2583 .5 


90, 


. 7 


519 


4 


Q9NQA6 


Q9nqa6 homo sapien 


5 


2571.5 


90 , 


.3 


551 


6 


Q9TSY7 


Q9tsy7 oryctolagus 


. 6 


2549.5 


89 


.5 


551 


11 


035956 


035956 rattus norv 


7 


2524.5 


88 , 


.6 


547 


6 


Q8MK48 


Q8mk48 sus scrofa 


8 


2463.5 


86. 


.5 


545 


11 


Q8VC6 9 


Q8vc6 9 mus musculu 


9 


2449.5 


86. 


,0 


545 


11 


Q61185 


Q6118 5 mus musculu 


10 


2437.5 


85. 


.6 


533 


6 


Q8MK47 


Q8mk47 sus scrofa 


11 


1421 


49. 


.9 


542 


6 


Q8HY24 


Q8hy24 oryctolagus 


12 


1416 


49. 


.7 


542 


4 


Q8TCC7 


Q8tcc7 homo sapien 



RESULT 2 
095742 

ID 095742 PRELIMINARY; PRT; 563 AA . 

AC 095742; 095187; Q9UEQ8 ; Q9UBG6 ; 

DT 01-MAY-1999 (TrEMBLrel. 10 , Created) 

DT 01-MAY-1999 (TrEMBLrel. 10, Last sequence update) 

DT 01-MAR-2 003 (TrEMBLrel. 23, Last annotation update) 

DE Renal organic anion transport protein 1. 

GN SLC22A6 OR 0AT1 OR R0AT1 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX ' NCBI_TaxID=96 06; 

RN [1] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC TISSUE =KIDNEY; 

RX MEDLINE=99103997; PubMed=9887087 ; 

RA Hosoyamada M., Sekine T. Kanai Y. , Endou H. ; 

RT "Molecular cloning and functional expression of a multispecif ic 

RT organic anion transporter from human kidney. "; 

RL Am. J. Physiol. 276 : F122 -F128 (1999) . 

RN [2] 



RP SEQUENCE FROM N.A. 

RX MEDLINE=98433806; PubMed=9762842 ; 

RA Reid G., Wolff N.A., Dautzenberg F.M., Burckhardt G: ; 

RT "Cloning of a human renal p-aminohippurate transporter, hROATl . " ; 

RL Kidney Blood Press. Res. 21:233-237(1998). 

RN C3] 

RP SEQUENCE FROM N.A. 

RC TISSUE =KIDNEY; 

RX MEDLINE=99137667; PubMed=995 0961 ; 

RA Lu R. , Chan B.S., Schuster V.L.; 

RT "Cloning of the human kidney PAH transporter: narrow substrate 

RT specificity and regulation by protein kinase C." ; 

RL Am. J. Physiol. 276 : F295 -F303 (1999) . 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE =KIDNEY; 

RX MEDLINE=99160894; PubMed=10049739 ; 

RA Race J.E., Grass 1 S.M., Williams W.J., Holtziaan E.J.; 

RT "Molecular cloning and characterization of two novel human renal 

RT organic anion transporters (hOATl and hOAT3 ) . " ; 

RL Biochem. Biophys . Res. Commun. 255:508-514(1999). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE = KIDNEY; 

RX MEDLINE=99393620; PubMed=10462 545 ; 

RA Cihlar T. f Lin D.C., Pritchard J.B., Fuller M.D., Mendel D.B., 

RA Sweet D.H. ; 

RT "The antiviral nucleotide analogs cidofovir and adefovir are novel 

RT substrates for human and rat renal organic anion transporter 1."; 

RL Mol. Pharmacol. 56:57 0-580(1999).. 

RN [6] 

RP SEQUENCE FROM N.A. 

RA Bahn A., Prawitt D. # Reid G., Enklaar T., Wolff N.A., Hillemann A., 

RA Godehardt S., Buttler D. f Knabe M., Schulten H.J., Gunawan B., 

RA Fuezesi L., Zabel B., Burckhardt G. ; 

RT "Genomic cloning and characterization of the human renal organic anion 

RT transporter gene ( hOATl ) . " / 

RL Submitted (SEP-1999) to the EMBL/GenBank/DDBJ databases. 

CC -1- FUNCTION: MEDIATES THE NA (+) -INDEPENDENT TRANSPORT OF ORGANIC 

CC ANIONS SUCH AS P-AMINOHIPPURATE AND ALPHA - KETOGLUTARATE . 

CC - ! - SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. BASOLATERAL 

CC MEMBRANE (POTENTIAL) . 

CC -!- ALTERNATIVE PRODUCTS: 2 ISOFORMS; HOATl -1 (SHOWN HERE) AND HOATl - 
CC 2; ARE PRODUCED BY ALTERNATIVE SPLICING. 

CC -!- TISSUE SPECIFICITY: STRONGLY EXPRESSED IN KIDNEY. ALSO DETECTED IN 
CC BRAIN AND AT LOW LEVELS, IN SKELETAL MUSCLE AND PLACENTA. NOT 

CC . EXPRESSED IN HEART, LUNG, LIVER, PANCREAS, COLON, THYMUS, SMALL 

CC INTESTINE OR PERIPHERAL BLOOD LEUKOCYTES. IN THE KIDNEY, STRONGLY 

CC EXPRESSED IN PROXIMAL TUBULE CELLS. 

CC - ! - PTM : GLYCOSYLATED . 

CC -!- SIMILARITY: BELONGS TO THE SLC22 FAMILY OF TRANSPORTERS. 

DR EMBL / AF05 7 03 9; AAC7 0004.1; -. 

DR EMBL; AB0096 97; BAA75 072.1; 

DR EMBL; AF104038; AAD10052.1; -. 

DR EMBL; AB009698; BAA75073.1; -. 

DR EMBL; AF097490; AAD19356.1; -. 

DR EMBL; AF124373; AAD55 35 6.1; -. 

DR EMBL; AJ249369; CAB77184.1; -. 

DR Genew; HGNC:10970; SLC22A6. 

DR InterPro; IPR007114; MFS. 

DR InterPro; IPR00474 9; Orgcat_transp . 

DR InterPro; IPR005828; Sub_transporter . 

DR Pfam; PF00083 ; sugar_tr; 1 . 

DR TIGRFAMs; TIGR008 98; 2 AO 11 9; 1. 

DR PROSITE; PS50850; MFS; 1. 

KW Transmembrane; Transport; Alternative splicing; Ion transport; 

KW Glycoprotein. 

FT TRANSMEM 13 6 156 POTENTIAL. 

FT TRANSMEM 185 2 05 POTENTIAL. 

FT TRANSMEM 249 2 69 POTENTIAL. 

FT TRANSMEM 338 3 58 POTENTIAL. 



FT 


TRANSMEM 


396 416 POTENTIAL. 




FT 


TRANSMEM 


426 446 POTENTIAL. 




FT 


TRANSMEM 


485 505 POTENTIAL. 




FT 


CARBOHYD 


39 39 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


C el CC XT T T / HWh C* \ I TjriT'TJXTT T 7i T \ 

56 56 N-LINKriD ^liLiLJMAL . . .) \ rU xEiIn 1 1 Axj J . 




FT 


CARBOHYD 


92 92 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


97 97 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


CARBOHYD 


113 113 N-LINKED (GLCNAC. . .) (POTENTIAL). 




FT 


VARSPLIC 


523 535 MISSING (IN ISOFORM HOAT1-2) . 




FT 


CONFLICT 


14 14 G -> S (IN REF. 3) . 




FT 


CONFLICT 


563 563 L -> F (IN REF. 2) . 




SQ 


SEQUENCE 


563 AA; 61816 MW; 74AD3EA2 678 032E4 CRC64 ; 




Query Match 


99.4%; Score 2 832.5; DB 4; Length 563; 




Best Local Similarity 97.7%; Pred. No. 4.4e-227; 




Matches 55 0; Conservative 0; Mismatches 0; Indels 13; Gaps 


1 


Qy 


± 


m a TTTjnT t nrt\Tnn\movrsr\ Tr»T7 r PTAnrr.'DT.T.T.Ma c umtt .nwcT 1 a a TDfWunjDDanaMT.cmj 

ClAr NUJ-iijyy Vv»V» V VjXvT yy iy V l Xj V V ljjr LiLi±jrLii i3 rLLNJ X XjyiNr I ii-H Iri HJl^ Klr^HlJ/ilNl J_i o 


60 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


1 


MAFNDLLQQVGGVGRFQQ IQVTLWLPLLLMASHNTLQNFTAA I PTHHCRPPADANLSKN 


60 


Qy 


6 1 




120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 


12 0 


Qy 


1 <51 
XZ ± 


XT^TJTaTTit T7V"CirD AT OAT A ACT VTwn7r*T7T T f AMVCPVT ATIT3T f~ , 'D'D\r\7~[ TT TvTVT AT" AT TCI flT* HA A 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


VTE WDLVCS HRALRQLAQSL YMVGVLLGAMVFGYLADRLGRRKVL I LNYLQTA VSGTCA A 


180 


Qy 


181 


FAPNFP I YCAFRLLSGMALAGI SLNCMTLNVE WMP IHTRACVGTL IGYVYSLGQFLLAGV 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 




JJXJ 


lOl 


r Acvir C ± ILAf XUjIjO ovin JjaV7X oXjIN^I'I X XjXV VJCi WFlr^ XI1X XvnV^ VI? X Xj Xur X V X ojjoi^r XjXjAVj V 


240 


Qy 


241 


AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 


300 






MINIMI 1 IIIIIIIIIMIIIIII III! II II II II MM 1 1 1 MM III 1 III 




Db 


241 


AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 


300 


Qy 


301 


GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 


360 






M M 1 1 1 1 1 1 II 1 Ml 1 II 1 MM II 1 1 1 II 1 II 1 II II 1 II 1 1 II II 1 1 1 1 1 II II 




Db 


301 


GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 


360 


Qy 


361 


QGFGVS I YL I QVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI C I LLNGVI PQDQS 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


QGFGVS I YL I QVI FGAVDLPAKLVGFLVINSLGRRPAQMAALLLAGI C I LLNGVI PQDQS 


420 


Qy 


421 


I VRTSLA VLGKGCLAAS FNC I FL YTGE L YPTM I RQTGMGMGSTMARVGS I VS PLVSMTAE 


480 






1 i 1 1 1 I 1 1 1 I 1 1 1 1 1 I 1 f 1 1 1 i 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 t 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 1 




Db 


421 


IVRTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGSIVSPLVSMTAE 


480 


Qy 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLES RKGKQT 


527 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINI 




Db 


481 


LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESRWAPTQKEAGIYPRKGKQT 


540 


Qy 


528 


RQQQEHQKYMVPLQASAQEKNGL 550 
IMIIMI 1 1 1 II II II II MM 




Db 


541 


RQQQEHQKYMVPLQASAQEKNGL 563 





RESULT 6 
035956 

ID 035956 PRELIMINARY; PRT; 551 AA . 

AC 035956; 

DT 01-JAN-1998 (TrEMBLrel . 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Renal organic anion transport protein 1 . 

GN SLC2 2A6 OR R0AT1 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



oc 
ox 

RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
RN 
RP 
RC 
RX 
RA 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 
NCBI_TaxID=10116; 
[1] 

SEQUENCE FROM N.A., FUNCTION, AND TISSUE SPECIFICITY. 
TISSUE=KIDNEY; 

MEDLINE=98043701; PubMed=9374486 ; 
Sweet D.H., Wolff N.A., Pritchard J.B.; 

"Expression cloning and characterization of R0AT1 . The basolateral 
organic anion transporter in rat kidney."; 
J. Biol. Chem. 272:30088-3 0095(1997). 
[23 

SEQUENCE FROM N.A., FUNCTION, AND TISSUE SPECIFICITY. 
STRA IN= S PRAGUE - DAWLE Y ; TISSUE= KIDNEY; 
MEDLINE=9737353 9; PubMed=922 8 014 ; 

Sekine T., Watanabe N., Hosoyamada M., Kanai Y., Endou H.; 
"Expression cloning and characterization of a novel multispecif ic 
organic anion transporter."; 
J. Biol. Chera. 272:18526-1852 9(1997). 

-!- FUNCTION: MEDIATES THE NA (+) -INDEPENDENT TRANSPORT OF ORGANIC 

ANIONS SUCH AS CYCLIC NUCLEOTIDES, PROSTAGLANDIN E2 AND URIC ACID. 

-I- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN. BASOLATERAL 
MEMBRANE (POTENTIAL) . 

-!- TISSUE SPECIFICITY: STRONG EXPRESSION IN KIDNEY. VERY WEAK 

EXPRESSION IN BRAIN. NOT DETECTED IN HEART, LUNG, LIVER, SPLEEN 
SKELETAL MUSCLE, SMALL INTESTINE, LARGE INTESTINE, EYE OR TESTIS. 
EXPRESSED IN THE PROXIMAL TUBULE IN THE KIDNEY. 

-1- PTM: GLYCOSYLATED (BY SIMILARITY). 

-!- SIMILARITY: BELONGS TO THE SLC22 FAMILY OF TRANSPORTERS. 

EMBL; AF008221; AAC18772.1; -. 

EMBL; AB00455 9; BAA22 086.1; -. 

InterPro; IPR007114; MFS . 

InterPro; IPR00474 9; Orgcat_transp . 

InterPro; I PRO 05 82 8; Sub_transporter . 

Pfam; PF00 083; sugar_tr; 1. 

TIGRFAMs; TIGR0 08 98; 2A0119; 1. 

PROSITE; PS5 085 0; MFS; 1. 

Glycoprotein ; 
POTENTIAL . 
POTENTIAL . 
POTENTIAL . 
POTENTIAL. " 
POTENTIAL . 
POTENTIAL . 
N-LINKED (GLCNAC. 
N- LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 
N-LINKED (GLCNAC. 



Transmembrane; Transport; 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
TRANSMEM 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
CARBOHYD 
SEQUENCE 



Ion transport . 



136 
196 
249 
338 
396 
485 

39 

56 

92 

97 
113 
184 
551 AA; 



156 
216 
269 
358 
416 
505 
39 
56 
92 
97 
113 
184 
60766 MW; 



(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 
(POTENTIAL) . 



8BA47BE628324BF2 CRC64; 



Query Match 8 9.5%; Score 2 549.5; DB 11; Length 551; 

Best Local Similarity 87.8%; Pred. No. 1.5e-2 03; 

Matches 484; Conservative 35; Mismatches 31; Indels 1; 



Gaps 



Qy 



Db 



1 MA FNDLL QQ VGGVGRFQ Q I Q VTL WL PL LLMA SHNTLQNFT A A I P THHC RP PA D ANL S KN 6 0 

Illlllhlllllllll Nihil lllllllllllllllllll llllllhllllh 

1 MAFNDLLKQVGGVGRFQLIQVTMVVAPLLLMASHNTLQNFTAAIPPHHCRPPANANLSKD 6 0 



Qy 



Db 



61 GGLE VWL P RDRQGQP E SCLRFTSPQ WGL P FLNGTE ANGTGATE P CTDGW I YDNS TF P S T I 12 0 

I I I I Ml hllllllllllllllll II llllllll I I I I llhllllllllll 
61 GGLEAWLPLDKQGQPESCLRFTSPQWGPPFYNGTEANGTRVTEPCIDGWVYDNSTFPSTI 12 0 



Qy 



Db 



121 VTEWDLVCSHRALRQLAQSLYMVGVLLGAMVFGYLADRLGRRKVLILNYLQTAVSGTCAA 18 0 

Illhlllllll I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 VTE WNLVCSHRAFRQLAQSLYMVGVLLGAMVFGYLADRLGRRKVL I LNYLQTAVSGTCAA 180 



Qy 181 F A PNF P I YC A FRLLS GMAL A G I S LNCMTLNVE WM P I HTRA C VGTL I G YVYS LGQ FL L A GV 2 40 

:Hh - I I llllllhll h:|IIMIIIIIIIIIII I I: I - I II 
Db 181 YAPNYTVYCVFRLLSGMSLASIAINCMTLNVEWMPIHTRAYVGTLIGYVYSLGQFLLAGI 240 



Qy 

Db 



241 AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 3 00 

llllll II 111 = 11 III llllll I I V :| III || II I I I I I I III I I I I U I 
241 AYAVPHWRHLQLWSVPFFIAFIYSWFFIESARWYSSSGRLDLTLRALQRVARINGKQEE 300 



Qy 301 GAKLSMEVXRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 36 0 

IIM|:|IM llllllh llllll I'M: llllll II I I I I I I I I I I I I I I I I I I 

Db 3 01 GAKLSIEVLRTSLQKELTLSKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 36 0 

Qy 361 QGFGVS I YL I Q VI FGA VDLPA KLVGFLV I NSLGRRPAQMAALLLAGI C I LLNGV I PQDQ S 420 

Illllhlllllllllllllll I lllllhlllllllhllllllllhlhlh : 
Db 361 QGFGVSMYL IQVI FGAVDLPAKFVCFLVINSMGRRPAQMASLLLAGI C I LVNGI I PKSHT 42 0 

Qy 421 IVRTSLAVLGKGCLAASFNCI FLYTGELYPTMIRQTGMGMGSTMARVGS I VSPLVSMTAE 48 0 

hi II I I II lllllhl I I I I I II I I I I I I hi I II hll II II I I I I I III I I I I I I I I 
Db 421 I IRTSLAVLGKGCLASSFNCIFLYTGELYPTVIRQTGLGMGSTMARVGS I VSPLVSMTAE 480 

Qy 481 LYPSMPLFIYGAVPVAASAVTVLLPETLGQPLPDTVQDLESR-KGKQTRQQQEHQKYMVP 539 

Illllllhlllll I I I I I I I I II I M II II I I I I h :||| :|||| M hi 
Db 481 FYPSMPLF I FGAVPWASAVTALLPETLGQPLPDTVQDLKSRSRGKQNQQQQEQQKQMMP 540 

Qy 54 0 LQASAQEKNGL 55 0 

I I I I llllll 
Db 541 LQASTQEKNGL 551 

RESULT 9 
Q61185 

ID Q61185 PRELIMINARY; PRT; 545 AA . 

AC Q61185; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01 -NOV- 19 96 {TrEMBLrel . 01, Last sequence update) 

DT 01-MAR-2 0 03 (TrEMBLrel. 23, Last annotation update) 

DE Kidney-specific transport protein. 

GN SLC22A6. 

OS Musmus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N. A . 

RC STRAIN=BALB/c; 

RX MEDLINE=97197822; PubMed=9 04 56 72 ; 

RA Lopez -Nieto C.E., You G., Bush K.T., Barros E.J., Beier D.R., 

RA Nigam S . K . ; 

RT "Molecular cloning and characterization of NKT, a gene product related 

RT to the organic cation transporter family that is almost exclusively 

RT expressed in the kidney."; 

RL J. Biol. Chem. 272:6471-6478(1997). 

CC -!- SUBCELLULAR LOCATION: INTEGRAL MEMBRANE PROTEIN (BY SIMILARITY). 

DR EMBL; U52842; AAC53112.1; -. 

DR MGD; MGI: 892001;. Slc22a6 . 

DR InterPro; IPR007114; MFS . 

DR InterPro; IPR00474 9; Orgcat_transp . 

DR InterPro; IPR00582 8; Sub_transporter . 

DR Pfam; PF00083; sugar_tr; 1. 

DR TIGRFAMs; TIGR00898; 2A0119; 1. 

DR PROSITE; PS50850; MFS; 1. 

KW Transmembrane. 

SQ SEQUENCE 545 AA; 59983 MW; CD3B762E5 984 933A CRC64; 

. Query Match 86.0%; Score 244 9.5; DB 11; Length 545; 

Best Local Similarity 84.9%; Pred. No. 3e-195; 

Matches 4 68; Conservative 41; Mismatches 35; Indels 7; Gaps 2; 
Qy 1 MAFNDLLQQVGGVGRFQQ IQVTLVVLPLLLMASHNTLQNFTAAI PTHHCRPPADANLSKN 60 

ii 1 1 1 1 hi 1 1 1 1 1 1 1 1 nihil ii mm ii ii ii ii nil i iiiiiihinih 

Db 1 MAFNDLLKQVGGVGRFQL I QVTMVVAPLLIjMASHNTLQNFTAA I PAHHCRPPANANLSKD 6 0 

Qy. 61 GGLEVWLPRDRQGQPESCLRFTSPQWGLPFLNGTEANGTGATEPCTDGWIYDNSTFPSTI 12 0 

MM II hlhllMIII I IMMIIM MM MhlMMIMM 

Db 61 GGLEARLPLDKQGRPESCLRFPFPH NGTE ANGTGVTE P CLDGWVYDNSTFPST I 114 



Qy 


12 1 


Db 


115 


Qy 


181 


Db 


175 


Qy 


241 


Db 


2 35 


Qy 


3 01 


T-\\_ 

DD 


2 95 


Qy 




Db 


355 


Qy 


421 


Db 


415 


Qy 


481 


Db 


475 


Qy 


540 


Db 


535 



VTEWDLVCSHRALRQLAQSLYWGVLLGAIWFGYLADRLGRRKVLILNYLQTAVSGTCAA 180 

lllhlllllll I Mil I hll I I I I llhl III I I II I I I I I II III II III I I I II I 
VTE WNLVCSHRAFRQLAQSLFMVGVLLGAMMFGYLADRLGRRKVL I LNYLQTAVSGTC AA 174 

FAPNFP I YCAFRLLSGMALAGI SLNCMTLNVE WMP IHTRACVGTL I GYVYSLGQFLLAGV 24 0 

= llh =11 I I II I I hll |::| II I I hi I I III II I I I I I I I I I I II I I I I I I h 
YA PNYTVYC I FRLLSGMSLAS I A I NCMTLNME WMP I HTRA YVGTL I GYVYSLGQFLLAG I 2 34 

AYAVPHWRHLQLLVSAPFFAFFIYSWFFIESARWHSSSGRLDLTLRALQRVARINGKREE 300 

llllllllllll II III' II 1 1 If II II I IM f 1 1 II 1 1 1 1 1 III 1 1 II! M M 

AYAVPHWRHLQLAVSVPFFVAFIYSWFFIESARWYSSSGRLDLTLRALQRVARINGKQEE 2 94 
GAKLSMEVLRASLQKELTMGKGQASAMELLRCPTLRHLFLCLSMLWFATSFAYYGLVMDL 36 0 

•ii 1 1 hi i h i ii nih mi ii ii mi ii ii iiiiiiimmimiiiiii 

GAKLS IEVLQTSLQKELTLNKGQASAMELLRCPTLRRLFLCLSMLWFATSFAYYGLVMDL 354 

QGFGVS I YL I QV I FGA VDLPA KLVGFLVINS LGRRPAQMAALLLAGI C I LLNGV I PQDQ S 420 

lllllhlllllllllllllll I I I I II h I I I I I I : I :| I I I I I I I h I h I h : 
QGFGVSMYL I QVI FGAVDLPAKFVCFLVINSMGRRPAQLASLLLAGI C ILVNGI IPRGHT 414 

IVRTSLAVLGKGCLAASFNCIFLYTGELYPTMIRQTGMGMGSTMARVGSIVSPLVSMTAE 480 

hi III II II II II hll II II llll II II II II II hi llllllllllll llhl I II I 
IIRTSLAVLGKGCLASSFNCIFLYTGELYPTMIRQTGLGMGSTMARVGSIVSPLISMTAE 474 

LYPSMPLFI YGAVPVAASAVTVLLPETLGQPLPDTVQDLESR-KGKQTRQQQEHQKYMVP 539 

iihiiihiiiniinii ihiiiiiiimiiihii nn >n i ii hi . 

FYPSIPLFIFGAVPVAASAVTALLPETLGQPLPDTVQDLKSRSRGKQKQQQLEQQKQMIP 534 



II I lllll 
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